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PATENT 

Attorney Docket No.: 201 8 1-005000US 
Client Reference No.: PPA-41 

METHOD FOR FAST EXECUTION OF TRANSLATED BINARY CODE 
UTILIZING DATABASE CACHE FOR LOW-LEVEL CODE 
CORRESPONDENCE 

CLAIM OF PRIORITY 
This Continuation-in-part application claims priority from co-pending U.S. 
Patent Application No. 09/505,652, filed February 17, 2000, entitled "System for Improving 
Translation of Software from a Native Computer Platform to a Target Computer Platform," 
which is anon-provisional of U.S. Provisional Patent Application Nos. 60/120,348, 
60/120,376, 60/120,380, 60/120,457, 60/120,458, 60/120,459, and 60/120,504, all filed 
February 17, 1999; each of which is incorporated herein by reference as if set forth in full in 
this document. 

CROSS-REFERENCES TO RELATED APPLICATIONS 
This Continuation-in-part application is related to co-pending U.S. Patent 

Application No. _/ (Attorney Docket 20181-49), filed April 18, 2001, entitled 

"Method and Apparatus for Preserving Precise Exceptions in Binary Translated Code; "U.S. 

Patent Application No. _/ (Attorney Docket 20181-55), filed April 18, 2001, entitled 

"Method For Emulating Hardware Features Of A Foreign Architecture In A Host Operating 

System Environment;" and U.S. Patent Application No. __/ (Attorney Docket 20181- 

51), filed April 18, 2001, entitled "Method for Effective Binary Translation Between 
Different Instruction Sets Using Emulated Supervisor Flag and Multiple Page Tables" each of 
which is incorporated herein by reference as if set forth in full in this document. 

BACKGROUND OF THE INVENTION 
There are a wide variety of commercially available microprocessors (often 
referred to as "processors") that a system designer may select for a particular application. 
Many of these processors are based on different design philosophies such as reduced 
instruction set computing (RISC) versus complex instruction set computing (CISC) and on 
different length of an instruction word (32-bit versus 64-bit versus 128-bit). In addition to 
different design philosophies manifested by the instruction set, various other architectural 
differences may be found in various processors. For example, some processors may execute 
instructions in parallel while others do not. Other processors may use shared memory while 
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others do not. Although there are a number of commercially processors based on competing 
and often incompatible architecture, the well-known x86 architecture is currently a popular 
architecture for which a tremendous amount of computer software has been developed. 
However, because of the necessity to maintain backward compatibility with less powerful 
5 processors, the x86 suffers from a lack of performance when compared to processors based 
on different architectures not constrained by the need to run legacy software and maintain 
backward compatibility. 

When legacy (or base) software is designed, it is typically written for a 
particular processor and computer system based on a particular architecture. And while it is 
1 0 possible to port foreign software to a new, but more advanced, architecture, it is a difficult 
task that requires a complete understanding of both the legacy architecture and the new 
architecture. Further, there are times where the source code, the human readable version of 
the foreign software, is not available. In such instances, it is often very difficult to accurately 
recreate the original functions of the software on the new architecture. This difficulty arises 
1 5 because of the various nuances the foreign software often incorporates to take advantage of 
the features of the legacy architecture. 

Clearly, it is useful to automate the translation of foreign software originally 
developed for a legacy architecture to a computer system, or processing "platform," having 
an architecture that is different from that for which the software was originally designed. 
20 Translation allows foreign software to execute on computer systems other than the one it was 
originally written and therefore increases the commercial value of the software. Further, 
translation can extend the life of the software past the life of the computer system for which it 
was originally designed. Clearly, it would be desirable if the translation process were 
automated to reduce, or eliminate, the cost to port foreign software to another architecture. 
25 Tne process of translating software from a foreign architecture to a native 

architecture is often referred to in the art as binary translation and various binary translation 
techniques are well known in the art. In general, binary translation converts binary foreign 
code, that is legacy software originally written and compiled for a specific architecture, to 
host code capable of being executed by a processor and computer system that is based on a 
30 different architecture. More specifically, binary compilation is the process of detecting each 
instruction and converting these instructions to one or more equivalent native operations. 
While it is often impossible to achieve a one-to-one translation of each instruction due to 
architectural differences, the translation process is especially difficult where the data 
constructs and memory addressing schemes differ between the foreign and native or target 
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architectural platforms and where the source code is not available. Accordingly, it is often 
necessary to analyze binary foreign code (rather than source code) to determine the most 
efficient equivalent instruction or instructions for achieving the same result when executed on 
the new architecture. This analysis and translation process is typically slow and labor 
intensive. Accordingly, any reduction of the need for the analysis and compilation not only 
has an important bearing on binary code performance but also on the cost to port the code to 
the native platform. 

By its very nature, software tends to be complex and must execute with a 
virtually a zero tolerance for errors. Unfortunately, certain areas of software translation are 
extremely difficult. Typically, these involve specialized instructions that deal with hardware 
components, high-performance instructions, addressing modes, input/output and other aspects 
that are closely tied to the hardware developed for the legacy architecture. 

In some prior art binary compilation schemes, the target platform typically 
includes a foreign engine adapted to execute foreign code instructions and a native engine 
designed to execute translated binary code (host code). This type of scheme translates certain 
foreign instructions during the compile process into a corresponding set of native binary code 
while other foreign instructions are typically not translatable. Where it is not possible to 
readily translate foreign code into host code, the target platform must be able to switch 
between the foreign engine and the native engine during run-time execution. Since, 
translated binary code needs to run as fast or faster on the target platform than the foreign 
code did on the foreign platform, it is often unacceptable to constantly switch between the 
two engines during run-time. For this reason, it is desirable to more fully translate the 
binary foreign code to host code prior to run-time. 

Thus, it is desirable to provide a method that achieves efficient and accurate 
translation of software so that it can be executed to operate on a variety of computer 
platforms without the need of expensive manual translation. Further, it is desirable to provide 
a method that achieves efficient and accurate translation of software so that it can be executed 
on a variety of computer platforms without have to switch between a foreign engine and a 
native engine. 

SUMMARY OF THE INVENTION 
The present invention relates to a method for efficiently executing translated 
binary code on a computer platform. More specifically, the present invention relates to a 



method that reduces the time for analysis and compilation of foreign code to host code. The 
method uses low-level code correspondence checking during binary runtime translation 
process. Advantageously, the method does not require obvious translation of foreign code 
into host code. 

The present invention improves the efficiency of the binary translation process 
by correlating previously translated binary host code to foreign code. This approach 
eliminates the repetitive process of translating the same foreign code more than once. In this 
approach, the translation process builds a database of translated foreign code that a software 
layer checks for correspondence whenever the foreign code is subsequently accessed. The 
database of binary code is preferably stored on a hard disk. 

In one preferred embodiment, as each sequence of foreign code (that is an 
instruction or a group of instructions) is acquired from secondary storage, correspondence is 
determined between the foreign code and the cache of host code. If corresponding host code 
is included in the database, it is transferred to system memory for execution. If the host code 
in the database does not include corresponding translated binary code then a translation 
process, which uses various compilation techniques, is invoked to translate the foreign code 
to host code. Newly compiled code is then added to the database so that it may be 
subsequently accessed should the need arise. Thus, the present invention eliminates the slow 
and repetitive process of translating the same foreign code more than once. In this manner, 
the host system may execute translated binary code in the native environment and achieve 
comparable or better performance than the foreign code executing on the foreign system. 

To further improve efficiency, the present invention also maintains the 
database so performance is not decreased. Thus, if a portion of the code in the database is not 
frequently accessed, it can be removed from the database or moved to an archival database. 
Further, if a portion of the code has not been accessed within a selected period of time, it can 
be removed from the database. If the removed portion of code is subsequently required, it 
will then simply be re-translated. 

As will be appreciated by one skilled in the art, most operating system and 
application programs are initially stored in a secondary storage device such as a disk drive. 
When a user selects the program for execution, the program's code is transferred from the 
disk drive to faster system-level memory. For larger application programs or operating 
systems, only a portion of the code may be transferred to memory as required by the 
particular task to be performed. Indeed, it is common for repeated disk accesses to transfer 
code from disk to system memory as the program performs a plurality of tasks. Each time the 
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program swaps code out of a portion of memory and replaces it with new code, the host 
computer system must make sure that the proper host code is executed. During these transfer 
operations, the present invention solves the problem of how to rapidly translate foreign code 
into host code without degrading program performance. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 illustrates a representative embodiment of a host computer system; 

Figure 2 illustrates a more detailed representation of the host computer system 
illustrated in Figure 1; 

Figure 3 is a flow diagram illustrating a correspondence checking algorithm; 

Figure 4 illustrates a flow diagram for using a database cache for minimizing 
the need to translate foreign code at run-time; and 

Figure 5 conceptually illustrates one preferred database approach for reusing 
translated binary code. 



DESCRIPTION OF THE SPECIFIC EMBODIMENTS 
The present invention relates to an apparatus and method for executing foreign 
binary code on a host computer. More particularly, the present invention relates to an 
improved apparatus and method for efficiently executing foreign code on a host system in 
real time. In the following description of preferred embodiments, reference is made to the 
accompanying drawings that form a part hereof, and in which is shown by way of illustration 
a specific embodiment in which the invention may be practiced. It is to be understood that 
other embodiments may be utilized and that changes may be made without departing from the 
scope of the present invention. For purposes of illustration, the following description 
describes the present invention as used with computer systems based, in general, on a RISC- 
based processor. However, it is contemplated that the present invention can also be used as a 
part of computer systems having multiple such processors or having CISC-based processors 
or processors based on explicit instruction level parallelism. It will also be apparent to one 
skilled in the art that the present invention may be practiced without the specific details 
disclosed herein. In other instances, well-known structures and techniques associated with 
the described processor or computer system have not been shown or discussed in detail to 
avoid unnecessarily obscuring the present invention. Reference will now be made in detail to 
the preferred embodiments of the invention, examples of which are illustrated in the 



accompanying drawings. Wherever possible, the same reference numbers will be used 
throughout in the drawings to refer to the same or like components. 

Referring now to Figure 1, a host computer system 100 is illustrated. Host 
computer system 100 comprises a computer processing unit (CPU) 102, a memory 
management unit (MMU) 104 and a storage device such as disk drive 106. Host CPU 102 
comprises one or more execution units 108 and a cache 110. Execution units 108 include 
logic to input and retrieve address pairs from cache 1 10 to facilitate the execution of 
translated binary code. When translating foreign code, execution units 108 input a foreign 
address to the cache and retrieve a corresponding host address for use during execution of 
translated binary code. The CPU 102 also includes a register file 1 12. MMU 104 includes 
logic to form in system memory a foreign virtual memory space 1 16 and a host virtual 
memory space 118. The MMU 104 also includes a translation lookaside buffer (TLB) 120 
designed to provide coherence between foreign code in foreign virtual memory 1 16 and 
translated binary code in host virtual memory 118. Any access of foreign virtual memory is 
handled by TLB 120. The dual virtual memory spaces 1 16 and 1 18 are used to maintain the 
content of the foreign virtual memory in a state consistent with the foreign architecture during 
execution of the foreign code. 

In accordance with the present invention, host computer system 100 translates 
foreign binary code at run-time. As used herein, foreign binary code means computer 
instructions written for a foreign processing system but ported to host computer system 100. 
A foreign processing system may, by way of example, be a computer processing system 
based on an Intel x86 processor, a Motorola 68xxx processor or a Sun Sparc processor. 
Compounding the difficulties of executing foreign code on the host is the differences in 
architecture between the host computer system 100 and the foreign system. 

In order to execute foreign code, host computer system 100 must first translate 
foreign binary code to equivalent host code. The translation process rebuilds foreign binary 
code using a host translator so that the translated binary program provides a sequence of host 
instructions that reproduce the behavior of the foreign program on the foreign architecture. 

To minimize performance penalties associated with executing translated 
binary code, host CPU 102 maintains the same data representation and processing as the 
foreign architecture using host registers 1 12. The host computer system 100 has certain 
foreign architecture independent features and some platform specific features to minimize 
performance degradation. All of the foreign hardware features that do not influence 
performance are preferably implemented in software. 



Referring now to FIG. 2 shows a preferred block diagram of computer system 
100 adapted to executing foreign operating system and application code. Computer system 
100 comprises host CPU 102 and MMU 104 together with hardware support for efficient and 
reliable execution of translated binary code. Through a software layer 206, CPU 102 
operates to execute foreign code on the host system. Software layer 206 functions as a 
minimal operating system for host computer system 100 when executing foreign code. 
Software layer 206 further includes the database management code. The process of decoding 
and semantic substitution is fulfilled in binary translator software. Foreign code is transferred 
from disk drive 106 and maintained in foreign virtual space 116 while the host processes are 
maintained in host virtual space 118. After semantic substitution for foreign operations in 
terms of host operations, the intermediate representation is processed by an optimizing binary 
translation process 202 to improve performance. More specifically, software layer 206 
functions as a basic operating system for the host platform. Its primary function is to interface 
the binary translation system with hardware-specific features of the host platform. So as to 
minimize the size of software layer 206, many features, which are inherent to an operating 
system, have been omitted. For example, the host platform software layer 206 does not 
include support for a file system, process management or virtual memory management. 

The software layer 206 functions in conjunction with hardware support 
features to achieve correct execution of binary translated foreign code. The hardware support 
features include a register that points to a bit string maintained in host virtual memory. Every 
bit in the bit string corresponds to a 4K page of memory. As will be appreciated by one 
familiar with the x86 platform, the x86's "physical memory" is divided onto 4K pieces. 
When other platforms are emulated, the page size will be different. It will be further 
appreciated that the length of the bit string is equal to the size of x86 memory. If a bit is set, 
the corresponding physical memory is locked and write access will be denied. Thus, any 
attempt to modify a locked memory location will cause a hardware exception. This hardware 
register tracks all modification to the foreign code and thus maintains correspondence 
between foreign and binary translated code. The register is maintained on the host, because if 
the protection bit were added to the foreign page table, it would violate the contents of the 
foreign memory. In this manner, a foreign operating system and foreign applications written 
for the foreign operating system (e.g., x86 software) can be executed on the host computer. 

The present invention provides a computer that is a virtual x86 computer 
where a user can install any x86 software. The host operating system is not a full operating 
system because it uses the features of the foreign operating system. The host operating 
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system is simply tasked with servicing hardware and software exceptions specific to the host 
system and interfacing to the system hardware for the binary translation system. As will be 
appreciated, the present invention comprises a "software layer" plus a "binary translation 
system" plus the "platform" results in a "virtual microprocessor" adapted to execute foreign 
operating system and application code. 

After the binary translation process, the binary translated image of the foreign 
code will be available for execution by the host processor. All foreign data remains in the 
foreign virtual space. Moreover, there could be constants in the original foreign code, which 
the binary translated code might have read. One solution would be to analyze the foreign 
code to find all memory accesses and put all data and constants into the host virtual memory. 
However, this analysis would have to correct all address constants in the code and insert 
special checks for load and store operations etc. Accordingly, it is desirable to maintain the 
data and constants in the foreign virtual space. Thus, as the translated binary code executes, 
the behavior of the foreign virtual space looks as if it is being updated in real-time just as if it 
were executing on the foreign platform. 

Foreign code execution on computer system 100 begins with transferring 
foreign code from disk drive 106 to foreign virtual space 116. After binary translation of the 
foreign code, all the binary translated pages in the foreign virtual space are write-protected so 
any subsequent write accesses will cause an exception. This protection mechanism maintains 
coherence between the foreign code and the translated binary code. Computer system 100 
also includes a dynamic binary translation process 203 and a dynamic analysis process 204. 
Dynamic binary translator 203 is used as a fast interpreter of a foreign code for two purposes. 
First, it enables immediate execution of the foreign code even if there is no pre-existing 
translated binary code in database 208 and to prepare information for the optimizing binary 
translator. Second, dynamic binary translator 203 is used in any recovery process for precise 
interrupt maintenance. 

Dynamic analysis process 204 functions like a monitor in the binary 
translation system. Dynamic analysis process 204 helps not only to control execution of 
translated binary code, but also to process all exceptions properly, and to invoke optimizing 
binary translator and provide it with profile information. Dynamic analysis process 204 also 
includes memory management functions relating to maintaining translated binary code 
compaction in memory and support coherence with the foreign code. Dynamic analysis 
process 204 is also responsible for processing special situations during execution of the 
translated binary code that were not discovered during binary translation. For example, self- 



modifying code, newly created code and exceptions are all situations that may not be 
discovered at binary translation time. All new information collected by the dynamic analysis 
process 204 is saved for further utilization by the optimizing binary translation process 202. 

Dynamic binary translation process 203 and optimizing binary translation 
process 202 work simultaneously. As binary translator 202 translates foreign code in 
optimized mode, dynamic binary translator 203 translates the code in fast and simple mode. 
When optimized translated binary code is ready, the control switches over to the optimized 
code at a coherent point for execution. The dynamic binary translator 203 is also used by 
dynamic analysis 204 to respond to any discovered special situations. Scheduling instruction 
execution is a function of binary translator 203. 

In accordance with the present invention, a method for executing translated 
binary code is provided. As noted above, foreign code is stored on disk drive 106 until it is 
transferred to memory 118. Typically, an entire sector of the code is transferred to memory 
1 18 in a single operation. 

During the transfer process, the method of the present invention uses low-level 
code correspondence checking during the runtime binary translation process where a 
sequence of low-level code, as will be understood in the art, is a sequence of basic machine 
operations. 

Each sequence must be translated using optimizing binary translation process 
202, dynamic binary translation process 203 and dynamic analysis process 204 in order to 
obtain optimal performance on host computer system 100. Where there is constant swapping 
of code in foreign virtual space 1 1 8, performance can degrade while processes 202-204 
translate the code. Further, in some instances, processes 202-204 could be repetitively 
translating a limited number of code segments resulting in an inefficient utilization of host 
computer 100 resources. 

To improve performance, code database 208 functions in conjunction with 
software 206 to minimize the necessity to translate code each time a swap occurs. As each 
sequence of binary operations is transferred to memory 1 16, corresponding translated code is 
transferred to host memory 1 18 for execution by the host computer 102 while bypassing 
processes 202-204. Thus, rather than translating the sequence each and every time foreign 
code is swapped into foreign virtual space 1 16, the corresponding host code stored in code 
database 208 is accessed on an as-needed basis and moved directly to active system memory 
of the host computer. With the corresponding host code in memory, processes 202-204 need 
not perform their translation functions thereby improving the utilization of system resources. 
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In operation, when a foreign application program is selected, computer system 
100 must load code into system memory. Accordingly, as a first step, a sequence of foreign 
code is acquired from disk drive and transferred to foreign virtual space 116. During the 
transfer operation, software 206 determines if corresponding translated binary code is already 
5 available in database 208. If software 206 locates corresponding translated binary code in 
database 208, that code is transferred to memory 118. If the database 208 does not contain 
corresponding code, then the method of the present invention provides for generating 
translated binary code by invoking processes 202-204. As soon as the binary host code is 
present in memory 118, execution units 106 may execute the code. 

10 The problem of fast execution of translated binary code can be further 

improved by optimizing the compiler but this approach requires substantial understanding of 
both the foreign and host platforms. Accordingly, the present invention teaches a method for 
setting the correlation between a foreign code and the correspondent translated binary code to 
exclude repetitive translation of the same code. 

15 A method for determining correspondence between the memory content of the 

host and foreign virtual spaces is illustrated in the flow diagram of Figure 3. This method is 
initiated by computer system 100 when self-extracted codes (i.e. code that is located in 
packed format on hard disk and which is unpacked after being loaded into the memory). The 
method may also be initiated when the binary translation system tries to jump into a foreign 

20 code page in memory where the corresponding translated binary code is not yet in memory. 
At step 302 software layer 206 obtains the entry point address and hence the relative entry 
point to the foreign code page. Then as indicated at step 304, a search is conducted for the 
foreign entry point in a switch table. The switch table contains pairs of addresses in memory: 
foreign address in the foreign memory and a corresponding address in the host memory for 

25 binary translated code. If search is successful (step 306), software layer 206 jumps to the 
binary translated code in host virtual space and begins to execute as indicated at step 308. 

If the search in step 306 is not successful, software layer 206 calculates a 
digital signature for foreign code page as indicated at step 310. The digital signature 
uniquely identifies the binary translated code and may be either a disk sector address or may 

30 be calculated using a hashing algorithm. Using the digital signature, software layer 206 
searches the database for binary translated code with the same relative entry point and the 
same signature as indicated at step 312. 

If the database search is successful (step 314), software layer 206 then 
compares at least a portion of the foreign code sequence in memory (foreign virtual space) 
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with a portion of the foreign code stored in database (step 316). The verification step 316 is 
facilitated, in one preferred embodiment, where database 208 includes at least a portion of the 
foreign code sequence associated with the binary translated code. Both the foreign code 
sequence and the binary translated code are stored in the database. Thereafter, software layer 
5 206 loads the binary translated code into host memory as indicated at step 318 and control is 
transferred to the newly loaded code as indicated at step 308. 

If, however, the search at step 314 is not successful, software layer 206 
generates a binary translation image as indicated at step 320. The translation process invokes 
the binary translator processes to generate the binary translated image for storage to the 
10 database. As a final check at step 322, software layer 206 verifies that the foreign code does 
not include an indirect jump. If an indirect jump is detected, the process flow proceeds back 
to step 302. If an indirect jump is not detected, the process flow proceeds to steps 318 and 
308. 

The algorithm illustrated in Figure 3 recognizes self-extracted codes (i.e. 

15 codes that are located in packed format on hard disk and which unpack themselves after they 
have been loaded into the memory) by use of a hashing algorithm. In an alternative 

£ = embodiment, the storage location of the foreign code on disk is used to uniquely identify the 
code. Instead of calculating the digital signature (step 310) using the hashing algorithm, 
software layer 206 checks for correspondence between a sequence of foreign code in foreign 

20 memory and the same foreign code on the hard disk. Then, software layer 206 checks for 
correspondence between the foreign code in the foreign memory and its binary translation 
image in the host memory. In this manner, software layer 206 ensures correspondence 
between the foreign code on the hard disk and the binary translated image but software layer 
206 must trace all transfer of foreign code from the hard disk and all changes of code on the 

25 hard disk. Further, this approach does not recognize self-extracted codes so it is possible to 
suffer some performance penalties with this alternative embodiment. 

Referring now to Figure 4, another preferred flow process in accordance with 
the present invention is illustrated. Specifically, when a foreign application program or 
operating system is executed, foreign code must be loaded into system memory. 

30 Accordingly, as a first step, step 402, a sequence of foreign code is acquired from the disk 
drive or other storage media under control of software layer 206. A sequence is preferably a 
page of memory (4k bytes) although one skilled in the art will understand that a sequence 
could be either more or less depending on engineering considerations and the foreign 
platform. 
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During the transfer, software 206 in step 404 identifies the code sequence such 
as by calculating a hashing function or determining the disk location where the code is stored. 
In addition, software 206 may further compare the code sequence to a foreign code sequence 
stored in the database together with the binary translated code where the foreign code 
5 sequence in the database is the code (or a portion of the code) that was previously translated. 
Once the code sequence is identified, software layer 206 then determines, as indicated at step 
406, if translated binary code that corresponds to the sequence of foreign code is available in 
database 208. 

If software 206 locates the corresponding translated binary code in the 

10 database, the foreign code is transferred, as indicated at step 408, to the portion of reserved 
system memory referred to as foreign virtual space 116 (see Figure 2). Then, software 206 
proceeds with the transfer of translated binary code from the database to system memory 
referred to as host virtual space 118 (see Figure 2) as indicated at step 410. If the database is 
resident on a separate physical storage device, the transfer to host virtual space may proceed 

1 5 substantially in parallel with the transfer of the foreign code. Once translated binary code is 
in memory, host computer system 100 may begin execution of the code as indicated at 412. 

If, however, software 206 is unable to identify translated binary code that 
corresponds to the sequence of foreign code, the method of the present invention proceeds 
from step 406 to step 414 where foreign code is transferred to foreign virtual space. Once the 

20 code is transferred, the method of the present invention then proceeds to compile the 

sequence of foreign code to obtain the translated binary code. This translation process is 
indicated at steps 416 through 422. More specifically, as indicated at step 416, binary 
translating process 202 performs a semantic substitution for foreign operations in terms of 
host operations to generate an intermediate representation. 

25 The intermediate representation is then processed by an optimizing binary 

translation process 202 to improve performance as indicated at step 418. Then, dynamic 
analysis process 204 is responsible for processing special situations that were not discovered 
during binary translation. Information collected by the dynamic analysis process 204 may be 
used by binary translation process 202. 

30 Upon completion of process steps 416-420, the translated binary code is 

transferred to system memory as indicated at step 422. Software 206 may selectively store 
the translated binary code to corresponding to the foreign code to database 208 as indicated at 
step 424 before transferring control to step 412. When this sequence of foreign code is 
subsequently accessed from the storage device, software 206 will detect that corresponding 
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code is available and it will be loaded directly from the database 208. Obtaining code from 
database 208 is more efficient than invoking process steps 416 to 424. 

As noted above, software 206 maintains correspondence between the selected 
foreign code and translated binary code located in database 208 by using either a disk 
5 indexing technique or a hash. Once code is requested, software 206 loads the selected 

sequence of translated binary code from the database module into system memory. It will be 
appreciated that if database 208 is stored on a second storage device, such as a second disk 
drive (not shown), the transfer of translated binary code from the second disk drive may 
proceed in parallel with the transfer of the foreign code from storage device 106. In this 

10 manner the transfer of translated binary code may be initiated with only the minimal delay 
necessary to determine the location of the corresponding code in the database. 

Alternatively, when foreign code is being loaded from disk, the disk sector 
where the code is stored is detected by software 206. Using the disk sector as an "address" or 
index into the database, software 206 will check the database for a block of translated binary 

15 code at the address. In this embodiment, the translated binary code is maintained in database 
208 as an ordered list. If the check of the database by software 206 finds valid code, the 
correspondent translated binary code is loaded into the memory and executed. Whenever a 
write is detected to a disk sector, that "address" is marked in database 208 as invalid because 
the foreign code will have been changed on storage device 106. Accordingly, the 

20 corresponding translated binary code is excluded from the database and it must be recompiled 
before being executed. 

When the translated binary code in the database exceeds the available space 
available for storage, selected code can be removed based on usage monitoring or on age of 
the code since it was last accessed. Software 206 is responsible for grooming the database to 

25 minimize the size and access times. 

Another method of recognizing the foreign code uses hash coding for 
determining an associative location in database 208. More specifically, a hashing function 
takes a portion of the foreign code sequence to generate an address in database 208. In one 
embodiment, the hashing function uses each foreign instruction to identify the location in 

30 database 208 where corresponding translated binary code is stored. While this granularity 
ensures the most flexibility, it is preferred that at least each page of memory be accessed as a 
module so as to minimize the possibility that the hash result will not be unique. One 
preferred hashing algorithm is the MD5 algorithm such as disclosed in Rivest, R., Network 
Working Group, Request for Comments: 1321; MIT Laboratory for Computer Science and 
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RSA Data Security, Inc., April 1992, which is incorporated herein by reference for all 
purposes. The MD5 algorithm is a public domain message-digest algorithm that takes as 
input a message of arbitrary length and produces as output a 128-bit "fingerprint" or 
"message digest" of the input. The MD5 algorithm is sophisticated enough that it is 
5 computationally infeasible to produce two messages having the same message digest, or to 
produce any message having a given pre-specified target message digest. The MD5 
algorithm is intended for generating unique digital signature applications but is used in this 
embodiment as the hashing algorithm to generate the index into the database. Other, faster 
hashing routines faster may be used where the amount of code stored in database 208 does 

1 0 not justify use of the MD5 algorithm. 

When hash coding is used for foreign code recognition, it may be useful to 
calculate only a portion of the entry point into the database code where the length of the code 
used to by the hash function is equal to physical page size of the foreign architecture. If the 
hash results in a unique database location, then foreign code recognition may be expanded 

15 using an additional length of foreign code thus permitting deeper optimization in selecting 
code segments from the database. In any event, the database will typically contain translated 
binary code (host code) and a corresponding hash value for the foreign code. The database 
may also contain a portion of the foreign code to facilitate identification of the foreign code 
sequence. For example, a beginning and ending portion of the code could be retained in the 

20 database for future reference. The database may also include relocation information that can 
be used when transferring code from the disk to host virtual space. The relocation 
information is associated with the translated binary code although it may be retained in a 
different database structure. The database may also include additional information that 
facilitates verification that specific code was translated by the optimizing compiler and an 

25 address switch table for identifying entry points for the translated binary code. The database 
may be a relational database or two, flat, linked databases one of which contains the hash 
table and additional information for necessary for code recognition and an index into the 
second base. The second database contains only optimized binary translated images with 
additional information for the loader. 

30 The database can be located at the same hard disk (part of the disk) which is 

used by foreign operating system (more precisely, by its translated binary version). But in 
this case all accesses to the hard disk must be monitored by software 206 so that the database 
remains invisible to the foreign operating system. To prevent corruption of the database, 
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software 206 must intercept each disk access generated by the foreign operating system to 
prevent unauthorized writes to the database. 

Referring now to Figure 5, the present invention is conceptually illustrated. 
When there is a control transfer to a sequence of foreign code as indicated at 502, the present 
5 invention determines whether the address has already been processed and whether the 
translated binary version 504 is already located in the host memory. Each sequence of 
foreign code 502 includes a plurality of operations or instructions such as indicated by parts 
1-4. The determination is carried out by the software layer, which is indicated at 506. If the 
translated binary version 504 is in memory, computer system 100 can execute translated 

10 binary code where corresponding operations are indicated by parts 1-4. If the foreign code is 
absent from memory, the present invention calculates the hash value for the foreign code and 
transfers control to the database management portion of software 206. After the hash value 
and entry point address into the database is determined, the database management subsystem 
tries to determine if corresponding translated binary code exists in the code database. If the 

15 code is found, more precise checking is not required and the translated binary code is loaded 
into the host memory. If there is no code in the database, the foreign code is translated by the 
binary translation process and placed, first, into the host memory. After it is optimized, it is 
placed into the database. 

If the translated binary code was constructed for a wider range of foreign code 

20 than is present in the foreign virtual space, the translated binary code is still useable. For 
example, if one page of foreign code were swapped out of memory, there would be a hole 
that would have to be filled with new foreign code. But since the translated binary code is 
still available in memory, the binary code may be nevertheless used after a check of 
correspondence part of foreign code command. By way of illustration, the translated binary 

25 code in host virtual space 1 1 8 is still executable if the corresponding sequence of foreign 

code (call it a region) occupies only one page in memory. If, however, a region occupies two 
or more pages in memory and translated binary code for this region already exists, it is still 
executable. However, if at some point in time during operation the foreign operating system 
(that is, the translated binary operating system) swaps one of the pages containing a part of 

30 the region out of memory, there is no longer correspondence between the host virtual space 
118 and foreign virtual space 116. In this event, the translated binary copy of the region 
would be incorrect, because in foreign system there will be trap when any instruction tries to 
access the swapped page. Accordingly, the present invention inserts into the translated 
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biliary code a "check operations" on page boundaries to guarantee the same behavior as in 
foreign system. 

Accordingly, in summary, the present invention provides, in one embodiment, 
a method for binary translation implemented on a computer system. The method comprises 
5 the steps of storing host code to a database; loading at least a portion of the host code from 
the database into the computer; controlling the correspondence between a current foreign 
code and database code; controlling the correspondence between the foreign code and 
creation of newly translated code for storage in the database for future reference; and 
compiling the foreign code to the host code if no corresponding code is available in the 

10 database. In another embodiment of the present invention, the efficiency of the binary 
translation process is improved by correlating previously translated binary host code to 
foreign code. This approach eliminates the repetitive process of translating the same foreign 
code more than once. During runtime execution, a software layer intercepts foreign code to 
check for correspondence between the foreign code and previously translated binary code. If 

15 correspondence is found, a set of translated binary code corresponding to the foreign code is 
transferred to the computer processor for execution. To minimize the translation process, 

■ " translated binary code is stored on the hard disk or other memory storage device. Once 

correspondence is determined, the code is moved from the database to system memory. This 
embodiment provides a method for binary translation implemented on a computer system 

20 comprising the steps of storing host code to a database; loading at least a portion of the host 
code from the database module into the computer; controlling the correspondence between a 
current foreign code and database-located host code module; controlling the correspondence 
between the foreign code and creation of an absence module; and compiling the foreign code 
to the host code module. 

25 The above-described embodiments achieve substantial improvement in 

executing foreign code on a host system. Although the improvement will vary depending on 
the foreign code (i.e., x86 versus Sun Sparc), how often code is accessed (i.e., hot spots in the 
foreign code) and the amount of the code in these hot spots, improvement can be on the order 
of two times the results achieved from run time dynamic binary translation execution without 

30 the database support. 

While certain exemplary preferred embodiments have been described and 
shown in the accompanying drawings, it is to be understood that such embodiments are 
merely illustrative of and not restrictive on the broad invention. Further, it is to be 
understood that this invention shall not be limited to the specific construction and 
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arrangements shown and described since various modifications or changes may occur to 
those of ordinary skill in the art without departing from the spirit and scope of the invention 
as claimed. 
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